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ABSTRACT.  Two  coding  problems  are  considered 
One  is  to  obtain  binary  codes  with  desirable 
auto-correlation  functions.  The  other  is  to 
obtain  families  of  such  codes  with  desirable 
cross-correlation  functions.  The  results  of 
a  random  computer  search  for  the  codes  are 
described.  An  approximation  to  the  expected 
number  of  trials  in  a  search  for  families  of 
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FOREWORD 


This  report  describes  the  results  of  a  random  computer 
search  for  binary  codes.  The  search  was  motivated  by  con¬ 
sultation  with  T.  A.  Wcstaway  of  the  Electronic  Systems 
Branch,  Systems  Development  Department,  on  the  problem  of 
obtaining  binary  codes  Of  length  100  with  desirable  correla¬ 
tion  properties.  In  anticipation  of  future  problems  in  this 
area,  data  were  obtained  on  codes  of  other  lergths. 

The  work  reported,  as  well  as  the  preparation  of  the 
report,  was  supported  by  the  Naval  Air  Systems  Command  Task 
Assignment  A- 04 - 5 35- 20 7/ 216-1/F099- 0  5 -0 2 . 

This  report  is  released  at  the  working  level.  Because.' 
of  the  continuing  nature  of  the  research  coding  problem, 
refinements  and  modifications  may  later  be  made  in  this 
study . 
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INTRODUCTION 


In  electronic  communications  problems  one  is  sometimes 
interested  in  obtaining  finite  length  sequences  of  l's  and 
- 1 T  s  with  small  auto-  and  cross-correlation  functions.  One 
way  of  acquiring  such  sequences,  or  families  of  such  se¬ 
quences,  is  to  generate  them  randomly  on  a  high-speed  digital 
computer  recording  the  desirable  ones  and  discarding  the  rest. 
This  report  describes  the  results  of  such  a  random  search 
with  emphasis  on  sequences  of  length  100. 


THE  SET  OF  SEQUENCES 


Let  Xn  be  the  set  of  all  2n  sequences  of  l’s  and  -l’s 
of  length  n.  Let  x  denote  the  sequence  ( Xj  , x2 , . . . , xn) . 
For  x,  y  e  Xn  let 


cR(x,y) 


r^Xiy^n  :  1  ~  1  ~  k)  if  l§k£n 

'-S{x.,,  y.  :  1  S  i  §  2n-k} 

1  l+k-n7!  J 


if  n+1  s  k  S  2n~l  (1) 


For  1  S  k  £  n,  Ck(x,y)  is  the  value  of  the  correlation 
of  the  left-most  k  elements  of  x  with  the  right-most  k 
elements  of  y.  For  n+1  S  k  s  2n-l,  ck(x,y)  is  the  value 
of  the  correlation  of  the  right-most  2n-k  elements  of  x 
with  the  left-most  2n-k  elements  of  y. 


The  ( 2n-l)  -tuple  c(x,y)  =  (c^Xjy),  c2(x,y),  ..., 
c2n-i(x>y))  of  integers  is  called  the  cross-correlation 
function  of  x.  The  following  integer -valued  functions  on 

:  1  s  k  5  2n-l}  if  x  /  y, 

:  1  s  k  S-  2n-l,  k  /  n) 

if  x  =  y  (2) 


a  xa  win  De  or  interest, 
n  n 


f  max  {  I  c,  (x,y) 
M(x,y)  =  ] 

1  max  {  I  c,  (x,y) 


1 
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For  simplification  we  let  m(x)  =  M(x,x).  Estimates  of 
the  mean  of  the  function  m,  and  (rm>  the  standard  devia¬ 
tion  of  m  for  values  of  n  ranging  from  10  to  1,000  were  ob¬ 
tained  by  random  sampling  in  the  spaces  Xn.  Table  1  lists  for 
each  n  the  mean  and  standard  deviation  of  the  sample  from 
Xn7  as  well  as  a  measure  of  the  difference  of  the  mean  from 
na/3.  It  is  interesting  to  note  how  closely  n2/3  approxi¬ 
mates  the  sample  mean  for  n  between  40  and  300. 


TABLE  1. 


n 

Sample 

size 

^ m 

n2/3 

-2/ 3 

% 

a 

m 

n2/3 

10 

500 

4.24 

4.64 

-0.086 

1.31 

20 

500 

6.97 

7.37 

-0.054 

1.79 

30 

500 

9.28 

9.66 

-0.039 

2.12 

40 

500 

11.49 

11.70 

-0.018 

2.61 

50 

500 

13.54 

13.57 

-0.002 

2.86 

60 

500 

15.49 

15 .33 

0.010 

3.25 

70 

500 

17.05 

16.99 

0.004 

3.4  2 

80 

500 

18.54 

18.57 

-0.002 

3.67 

90 

500 

I  20.17 

20.08 

0.004 

4.00 

100 

500 

21.53 

21.54 

0.000 

4.05 

110 

500 

22.91 

22.96 

-0.002 

4.32 

120 

500 

24.49 

24 . 33 

0.007 

4 . 30 

130 

500 

25.69 

25.66 

0.001 

4 .53 

140 

500 

27,22 

26.96 

0.010 

4.58 

150 

500 

28.32 

28.23 

0.003 

4 . 74 

200 

500 

33.74 

34 . 20 

-0.013 

5.07 

300 

500 

44.19 

44.81 

-0.014 

6.28 

500 

300 

60.02 

63.00 

-0.047 

7.68 

1,000 

200 

91.69 

100.00 

-0.083 

11.35 

SELECTING  SINGLE  SEQUENCES 


The  first  practical  problem  motivating  this  s  :uc.y  was  to 
find  sequences  x  e  X100  such  that  m(x)  is  small.  Genera¬ 
tion  of  500  random  sequences  indicated  that  the  probability 
that  m(x)  £  15  is  about  0.026.  Among  the  sequences  chosen 
none  satisfied  m(x)  £  10.  On  the  UNIVAC  1108  computer  500 
members  of  X10o  can  chosen  and  the  corresponding  values 
of  m  computed  in  a  little  less  fhan  one  minute.  If  one  is 
interested  in  obtaining  only  sequences  x  such  that  m(x) 
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is  less  than  or  equal  to  some  fixed  bound  b,  then  one  dis¬ 
cards  the  sequence  x  immediately  after  computing  the  first 
Ck(x,x)  such  that  icj^CXjX)!  exceeds  b.  This  eliminates 
unnecessary  calculation  of  the  Ck(x,x)  for  unacceptable 
sequences  thus  reducing  the  running  time  on  the  computer. 

Of  course,  if  bj  £  ba ,  then  the  number  of  sequences  satis¬ 
fying  m(x)  £  b.  will  be  less  than  the  number  satisfying 
m(x)  £  b2 .  Table  2  shows  the  results  of  computer  runs  select¬ 
ing  sequences  with  upper  bounds  of  13,  15,  and  20  for  m(x) 
in  the  set  X100‘  In  the  case  of  the  80  sequences  found  with 
an  upper  bound  of  13,  none  had  upper  bound  £  10. 


TABLE  2. 


Bound 

No.  of  sequences 
tested 

No.  of  good 
sequences 

Running 

time 

13 

30,000 

SO 

10  min,  31  sec 

15 

5,000 

150 

3  min,  1  sec 

20 

900 

374 

2  min,  48  sec 

SELECTING  FAMILIES  OF  SEQUENCES 


The  second  practical  problem  under  consideration  was  to 
obtain  a  family  x<1 >,  x<2>,  ...»  x(f)  of  sequences  from 
X100  such  that  M(x( 1 ) , x< J ) )  is  less  than  some  bound  b 
for  1  £  i  £  j  £  f.  Here  one  :1s  confronted  with  the  problem 
of  determining  values  of  f  and  b  for  which  the  computer 
will  generate  a  solution  in  a  reasonable  length  of  time. 

Suppose  the  probability  that  m(x)  £  b  is  p  for  a 

random  sequence  x.  The  value  m(x)  is  the  maximum  of 
the  first  99  values  of  jc(x,x)j  since 

ckCx’x)  =  cs„0-k<x'x)  O) 

i.e.,  the  function  c(x,x)  is  symmetric  about  k  =  100. 

However,  for  M(x,y)  to  be  less  than  or  equal  to  b,  one 

must  have  |c^(x,y)|  £  b  for  1  £  k  £  199.  If  one  assumes 
that  the  functions  cx  ,  c2 ,  ...,  c199  are  statistically 
independent,  then  the  probability  that  M(x,y)  £  b  for  two 
random  sequences  x,  y  is  approximately  p2 . 
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Now  suppose  one  wishes  to  select  randomly  a  family 
X*1),  x<2),  ...»  of  sequences  from  X100  as  described 

above.  First  one  chooses  sequences  until  a  sequence  x<l) 
satisfies  mCx^1 ))  i  b.  Next  one  chooses  sequences  until  a 
sequence  x^2  )  satisfies  M(x(l),x*20  £  b  and  m(x^2  O  S  b. 
Continuing  recursively  one  chooses  at  stage  r  a  sequence 
x(r)  satisfying 

M(x(1 } ,  x(r))  S  b 
M(x(2 ) ,  x(r))  S  b 


MCx*1'-1*,  x(r))  S  b 

m(x(r))  £  b.  (4) 

Having  chosen  sequences  x(1),  x{2),  .  ..,  xir_1)  the 
probability  that  a  randomly  chosen  sequence  x  satisfies 
the  restrictions  of  Eq.  4  on  x(r)  is  approximately 


r-i 


par-! 


It  should  be  noted  here  that  (x  ,  ...,  x  )  is  not 
actually  the  result  of  a  random  selection  from  the  cross- 
product  space 


X  xx 

100  100 


X 


since  the  family  x(  ,  .  ..,  x  already  satisfies  in¬ 

equalities  similar  to  Eq.  4.  Thus,  p2r":  is  possibly  a 
poorer  approximation  than  it  might  seem. 

Suppose  xfr>  is  the  sequence  tested  after  x(r_1) 

Then  Nr  is  a  random  variable,  and  p1-2r  approximates  the 
mean  of  Nr.  Let  N  denote  the  number  of  sequences  tested 
in  order  to  select  the  entire  family  x(1>,  x^2),  ...,  x^J. 

N  is  also  a  random  variable,  and 


N  =  N  +  N  +  ...  +  N, . 
12  r 


(5) 


4 
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Since  the  mean  of  N  is  the  sum  of  the  means  of  the  N-j_ , 
we  have  the  following  approximation  to  mn>  where  n i  is 
the  mean  of  N^. 


.  ,  -l  .  -  3  .  ,  l -af 

+  • • •  +  Mf  “  P  +  P  +  • • •  4  P 

=  p-1(l+p-2+...+p-J(f-,,) 


-p‘-f 

'  1-P V 


2  f 


Since  p  will  in  general  be  very  small,  we  have 


„N  _  P1-',(i.p,>-1 


(6) 


(7) 


Table  3  shows  estimates  of  hn  based  on  approximation 
(Eq.  7)  for  three  values  of  b.  In  the  table  p  is  the 
probability  that  m(x)  i  b  as  estimated  by  a  sample  of  500 
members  of  X 

100 


TABLE  3. 


b 

Pb 

N 

15 

0.026 

9.25  x  1046 

20 

0.464 

CD 

O 

»H 

X 

CO 

cn 

in 

25 

0 . 840 

l 

5.34  x  102 

1  -  -  

On  the  basis  of  Table  3  it  was  decided  to  place  the 
bound  b  for  families  of  sequences  from  X100  at  25. 

Table  A  shows  the  results  of  ten  computer  runs  each  select¬ 
ing  a  family  of  15  sequences  with  auto-  and  cross-correlation 
bounded  by  25.  At  the  bottom  of  Table  4  the  means  of  the 
sample  N’s  and  running  times  are  given.  In  view  of  the 
various  assumptions  made  in  approximating  N  in  Table  3  the 
difference  between  the  approximation  534  for  b  =  25  in 
Table  3,  and  the  sample  of  ip20  in  Table  4  cannot  be 

considered  too  surprising. 
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TABLE  4 


N 

Running 

time 

1,582 

9  min  55  sec 

971 

6  min  34  sec 

563 

3  min  55  sec 

934 

6  min  6  sec 

1,678 

10  min  22  sec 

824 

5  min  18  sec 

1,561 

9  min  26  sec 

798 

5  min  7  sec 

919 

5  min  58  sec 

368 

2  min  41  sec 

1,020 

6  min  32  sec 
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